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Abstract 

We study stochastic differential equations (SDEs) whose drift and 
diffusion coefficients are path-dependent and controlled. We construct 
a value process on the canonical path space, considered simultaneously 
under a family of singular measures, rather than the usual family of 
processes indexed by the controls. This value process is character- 
ized by a second order backward SDE, which can be seen as a non- 
Markovian analogue of the Hamilton-Jacobi-Bellman partial differen- 
tial equation. Moreover, our value process yields a generalization of 
the G-expectation to the context of SDEs. 
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1 Introduction 

We consider a controlled stochastic differential equation (SDE) of the form 

Xt = x+ [ fi{r,X,Ur)dr+ [ a{r,X,iyr) dWr, 0<t<T, (1.1) 
Jo Jo 

where v is an adapted control process, IF is a Brownian motion and the 
Lipschitz-continuous coefficients fi{r,X,i'r) and a{r,X,i'r) may depend on 
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the past trajectory {Xg, < s < r} of the solution. Denoting by X'^ the so- 
lution corresponding to z^, we are interested in the stochastic optimal control 
problem 

Vo:=snpE[aX'')], (1.2) 

V 

where ^ is a given functional. The standard approach to such a non-Markovian 
control problem (cf. [9l [10]) is to consider for each control v the associated 
value process 

J\ = esssup E\i{X^)\Tt\ (1.3) 

u: v=v on [0,t] 

where (J-f ) is the given filtration. The dependence on v reflects the presence 
of a forward component in the optimization problem. 

The situation is quite different in Markovian optimal control (cf. |12)). 
where one uses a single value function which depends on certain state vari- 
ables but not on a control. This is essential to describe the value function by 
a differential equation, such as the Hamilton- Jacobi-Bellman PDE, which is 
the main merit of the dynamic programming approach. It is worth noting 
that this equation is always backward in time. An analogous description 
for p.3p via backward SDEs (BSDEs, cf. |20j ) is available for certain popu- 
lar problems such as utility maximization with power or exponential utility 
functions (e.g., [l3l[T8]) or drift control (e.g., [lO]). However, this relies on a 
very particular algebraic structure which allows for a separation of J'^ into 
a backward part independent of v and a forward part depending on z^. 

In this paper, we consider the problem (|1.2p on the canonical space by 
recasting it as 

Vo = snpEP''[m], (1.4) 

V 

where is the distribution of X^ and B is the canonical process, and we 
describe its dynamic value by a single value process V = {V((a;)}. Formally, 

V corresponds to a value function in the Markovian sense if we see the whole 
trajectory of the controlled system as a state variable. Even though (|1.2|) 
has features of coupled forward-backward type, the value process is defined 
in a purely backward manner: one may say that by constructing V on the 
whole canonical space, we essentially calculate the value for all possible out- 
comes of the forward part. An important ingredient in the same vein is that 

V is defined "quasi-surely" under the family of mutually singular measures 
{P*^}. Rather than forming a family of processes as in (jl.Sp . the necessary 
information is stored in a single process which is defined on a "large" part 
of the probability space; indeed, the process V "seen under P*^" should be 
thought of as an analogue of . Clearly, this is a necessary step to ob- 
tain a (second order) backward SDE. We remark that [22] considered the 
same control problem (|1.2p and also made a connection to nonlinear expec- 
tations. However, in [22], the value process was considered only under the 
given probability measure. 
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We first consider a fairly regular functional ^ and define Vt{uj) as a con- 
ditional version of (jl.4p . Applying and advancing ideas from |28) and |17| . 
regular conditional probability distributions are used to define Vt(cj) for ev- 
ery bj and prove a pathwise dynamic programming principle (Theorem 13. 2p . 
In a second step, we enlarge the class of functionals ^ to an L^-type space 
and prove that the value process admits a (quasi-sure) cadlag modification 
(Theorem 15. 1|) . 

We also show that the value process falls into the class of sublinear ex- 
pectations studied in |19| . Indeed, if ^ is considered as a random variable on 
the canonical space, the mapping ^ i— )• Vf can be seen as a generalization of 
the G-expectation |23| I24|. which, by [7|, corresponds to the case = and 
(T(r, X, fj,) = z/^, where the SDE (jl.ip degenerates to a stochastic integral. 
Moreover, Vt can be seen as a variant of the random G-expectation |17| : cf. 
Remark 16.51 

Finally, we characterize V hy a second order backward SDE (2BSDE) in 
the spirit of |19| : cf. Theorem [631 The second order is clearly necessary since 
the Hamilton-Jacobi-Bellman PDE for the Markovian case is fully nonlinear, 
while ordinary BSDEs correspond to semilinear equations. 2BSDEs were 
introduced in [5], and in [27] for the non-Markovian case. We refer to |27j 
for the precise relation between 2BSDEs in the quasi-sure formulation and 
fully nonlinear parabolic PDEs. 

We remark that our approach is quite different from the (backward) 
stochastic partial differential equations studied in [21] for a similar con- 
trol problem (mainly for uncontrolled volatility) and in |14| [TS} [21 [S] H] for 
so-called pathwise stochastic control problems. The relation to the path- 
dependent PDEs, introduced very recently in [25], is yet to be explored. 

The remainder of the paper is organized as follows. In Section[2lwe detail 
the controlled SDE and its conditional versions, define the value process for 
the case when ^ is uniformly continuous and establish its regularity. The 
pathwise dynamic programming principle is proved in Section [3l In Section [H 
we extend the value process to a more general class of functionals ^ and 
state its quasi-sure representation. The cadlag modification is constructed 
in Section [5l In the concluding Section [6l we provide the Hamilton-Jacobi- 
Bellman 2BSDE and interpret the value process as a variant of the random 
G-expectation. 

2 Construction of the Value Function 

In this section, we first introduce the setting and notation. Then, we define 
the value function Vt{ijj) for a uniformly continuous reward functional ^ and 
examine the regularity of Vt- 
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2.1 Notation 



We fix a constant T > and let Q := C([0, T]; R"^) be the canonical space of 
continuous paths equipped with the uniform norm \\u}\\t '■= supo<s<T I'^sl) 
where | • | is the Euclidean norm. We denote by B the canonical process 
Bt{oj) = Wf, by Pq the Wiener measure, and by F = {Tt}o<t<T the (raw) 
filtration generated by B. Unless otherwise stated, probabilistic notions 
requiring a filtration (such as adaptedness) refer to F. 

For any probability measure P on 17 and any (t, oo) G [0, T] x fi, we can 
construct the corresponding regular conditional probability distribution P^] 
cf. [291 Theorem 1.3.4]. We recall that Pf is a probability kernel on Tt x ^T] 
i.e., Pf is a probability measure on [VI^Tt) for fixed uj and oo i— t- P^{A) is 
J^t-measurable for each A £ J-t- Moreover, the expectation under P^ is the 
conditional expectation under P: 

EPni]=EP[i\Ft]{u) P-a.s. 

whenever ^ is J^-measurable and bounded. Finally, P'l' is concentrated on 
the set of paths that coincide with uj up to 

Pf'^jw' G : w' = w on [0,t]} = 1. (2.1) 

While P^ is not defined uniquely by these properties, we choose and fix one 
version for each triplet (t,UJ,P). 

Let t G [0,r]. We denote by 0* := {w G C([t,T];M'^) : = 0} the 
shifted canonical space of paths starting at the origin. For lo £ ^1, the 
shifted path tj* G J7* is defined by tj* := ojr — oot for t < r < T, so that 
$7* = {(x!* : w G O}. Moreover, we denote by Pq the Wiener measure on 0* 
and by F* = {J^}t<r<T the (raw) filtration generated by i?*, which can be 
identified with the canonical process on 0*. 

Given two paths uj and Cj, their concatenation at t is the (continuous) 
path defined by 

(w ®t w)r := a;rl[o,t)(r) + {uJt + Cb\)l]^t,T]{r), <r <T. 

Given an /^-measurable random variable ^ on $7 and cj G fi, we define the 
conditioned random variable i^*'"^ on Q by 

C*''^(a;) := ^(cj^jlD), w G 17. 

Note that (^''"^(a}) = .^''"^(o;*); in particular, ^^'^ can also be seen as a ran- 
dom variable on Q^. Then Cj i— )■ ^''"^(lD) is J-"|i-measurable and moreover, 
^^''^ depends only on the restriction of uj to [0, t] . We note that for an F- 
progressively measurable process {X^, r G [s,T]}, the conditioned process 
{Xp"^, r G [t,T]} is F* -progressively measurable. If P is a probability on Jl, 
the measure P*''^ on J-^ defined by 

pt-'^(^A) := P^{bJ ®t A), AgT^, where w (g)( a := {w (g)t cj : tD G A}, 
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is again a probability by ()2.ip . We then have 

E^^'^ien = E^'^li] = EP[i\Ft]{u:) P-a.s. 

Analogous notation will be used when ^ is a random variable on $7'^ and 
uj £ Q"", where < s < t < T. We denote by := {u)\[s^t] '■ ^ ^ ^''1 the 
restriction of to [s,t], equipped with HwHj^^^j := sup^gj^ \ujr I . Note that 
Jlf can be identified with {lo £ 0,^ : uj^. = oot for r G [t,T]}. The meaning of 
Jlj is analogous. 

2.2 The Controlled SDE 

Let U he a nonempty Borel subset of for some m G N. We consider two 
given functions 

/X : [0, T] X X [/ ^ M'^ and cr : [0, T] x J7 x [/ ^ M'^^'^, 

the drift and diffusion coefficients, such that {t^uj) i— t- n{t,X{uj),ut{u})) and 
{t,u}) I—)- a{t,X(uj),i't{uj)) are progressively measurable for any continuous 
adapted process X and any [/-valued progressively measurable process v. 
In particular, fi{t,uj,u) and a{t,uj,u) depend only on the past trajectory 
{ur, r G [0,t]}, for any u £ U. Moreover, we assume that there exists a 
constant K > such that 

\fi{t,uj,u) — /i(t,u;',M)| + \a{t,uj,u) — a{t,oj',u)\ < K\\uj — u}'\\t (2.2) 

for all (t, w, oj', u) G [0, T] x 17 x x [/. We denote by U the set of all [/-valued 
progressively measurable processes u such that 

I \^{r,X,Vr)\dr < oo and / |cr(r, X, fj.)|^ dr < oo (2.3) 
Jo Jo 

hold path- by-path for any continuous adapted process X . Given u £U, the 
stochastic differential equation 

Xt = X + / ^{r,X,Vr)dr+ I a{r, X^Vj.) dB^, 0<t<T under Pq 
Jo Jo 

has a Pq-Sl.s. unique strong solution for any initial condition x G M'^, which 
we denote by X(0, x, u). We shall denote by 

P{<d,x,v) := PqoX{Q,x,u)-^ (2.4) 

the distribution of X{{),x,v) on Q. and by 

P(0,x,zy) :=Poo (^(0,x,z.)0)-' 

the distribution of X(0,x,z^)° = X(0, x^u) — X] i.e., the solution which is 
translated to start at the origin. Note that P{0,x,h') is concentrated on 17^ 
and can therefore be seen as a probability measure on 

We shall work under the following nondegeneracy condition. 
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Assumption 2.1. Throughout this paper, we assume that 

D F for all X = X{0, x, v), (2.5) 

where " is the Po-^-ugmentation of the filtration generated by X and 
(x, u) varies over x U. 

One can construct situations where Assumption 12.11 fails. For example, 
if X = 0, cj = 1 and iJ,{r,X,i'r) = '^r, then (j2.5p fails for a suitable choice 
of u; see, e.g., The following is a positive result which covers many 

applications. 

Remark 2.2. Let a be strictly positive definite and assume that /x(r, X, v^) 
is a progressively measurable functional of X and a{r,X,i'r). Then (|2.5p 
holds true. 

Note that the latter assumption is satisfied in particular when /i is un- 
controlled; i.e., fi{r,uj,u) = fj,{r,uj). 

Proof. Let X = X{0,x,u). As the quadratic variation of X, the process 
J aa^ {r,X,Vr) dr is adapted to the filtration generated hy X. In view of 
our assumptions, it follows that 

M := a(r, X, u,) dBr = X - x - / /i(r, X, vA dr 



has the same property. Hence B = f a{r,X,Vr) ^ dM^ is again adapted to 
the filtration generated hy X. □ 

Remark 2.3. For some applications, in particular when the SDE is of ge- 
ometric form, requiring (|2.5p to hold for all x G M'^ is too strong. One can 
instead fix the initial condition x throughout the paper, then it suffices to 
require (j2.5p only for that x. 

Next, we introduce for fixed t G [0, T] an SDE on [t, T] x $7* induced by /i 
and cr. Of course, the second argument of fi and a requires a path on [0, T], 
so that it is necessary to specify a "history" for the SDE on [0,t]. This role 
is played by an arbitrary path i] £ Q. Given rj, we define the conditioned 
coefficients 

: [0, T] X 0* X [/ ^ M'', ^*'''(r, to, u) := fi{r, r] ®t w, u), 

a*''i : [0, T] X f]* X [/ ^ R'^'"^, a*'''(r, u, u) := a{r, r] ®t uj, u). 

(More precisely, these functions are defined also when a; is a path not neces- 
sarily starting at the origin, but clearly their value at {r,uj,u) depends only 
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on w*.) We observe that the Lipschitz condition ()2.2p is inherited; indeed, 

|/i*'''(r, (J, u) - fL''\r, to', u)| + |a*'^(r, u, u) - a''\r, to', u)\ 

< K\\7](^t(^ -r]®t ^'\\[t,r] 
= K\\J 

< 2K\\uj — uj'W^i^^Y 

We denote by lA^ the set of all F*-progressively measurable, [/-valued pro- 
cesses V such that \^{r,X^Ur)\dr < oo and \a{r,X,i'r)\'^ dr < oo for 
any continuous F*-adapted process X = {Xr,r G [0, T]}. For u £ U*, the 
SDE 

X, = r]t+ / /i*'''(r, X, Vr) dr+ j a^'^'ir, X, Vr) dB^, t<s<T under Pg* 
Jt Jt 

(2.6) 

has a unique solution X(t,?7, i/) on [t,T]. Similarly as above, we define 

Pit,r],u) :=P^o{Xit,r],uY)-' 

to be the distribution of X{t, r], z/)* = X{t, r], u) — rjt on $7*. Note that this is 
consistent with the notation P(0, x, u) if x is seen as a constant path. 

2.3 The Value Function 

We can now define the value function for the case when the reward func- 
tional ^ is an element of UCb(ri), the space of bounded uniformly continuous 
functions on 0. 

Definition 2.4. Given t £ [0,T] and ^ G \JCb{Q), we define the value 
function 

Vtioo) = Vti^-u) = sup (t,oo) G [0,T] x 0. (2.7) 

The function ^ is fixed throughout Sections [2] and [3] and hence often 
suppressed in the notation. In view of the double dependence on to in (|2.7p . 
the measurability of Vt is not obvious. We have the following regularity 
result. 

Proposition 2.5. Let t G [0,r] and ^ G UC;,(17). Then Vt G UCb{^t) and 
in particular Vt is J^t-nieasurable. More precisely, 

\Vt{Lo) -Vt{Lo')\ < p{\\uj -Lo'Wt) for all G O 

with a modulus of continuity p depending only on the Lipschitz constant 
K and the time horizon T. 
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The first source of regularity for Vj is our assumption that ^ is uniformly 
continuous; the second one is the Lipschitz property of the SDE. Before 
stating the proof of the proposition, we examine the latter aspect in detail. 

Lemma 2.6. Let ip G UCft(0*). There exists a modulus of continuity Pk,t,i(>> 
depending only on K, T and the minimal modulus of continuity of tp, such 
that 

for all t G [0, T], v ^U^ and £ ft. 

Proof. We set E[-] := E^o[.j to alleviate the notation. Let uj,Cj £ Q,, set 
X := X{t,uj, u) and X := X{t, u, u), and recall that X^ = X — Xt = X — uit 
and similarly X^ = X — ujf 

(i) We begin with a standard SDE estimate. Let X* = M* + and 
X* = M* + A* be the semimartingale decompositions and t < t < T he 
a stopping time such that M^,M^,A^,A'' are bounded on [t,r]. Then Ito's 
formula and the Lipschitz property (|2.2p of a yield that 

£;[|M* -M*|2] < E \a{r,uj (S,t X,i^r) - cr{r,Cj X,Ur)\'^ dr 

< K^E [ \\io(^tX -Co 0t Xf^ dr 



< K^E {\\uj - 0\\t + \\X' - X'\\[t^r]fdr 



< 2K^T\\u; - + 2K' / E[\\X' - X'\\l,.^,]] dr. 



Hence, Doob's maximal inequality implies that 

|2 



E[\\]VP - M^WlJ < 4£;[|M* - 



< 8K^T\\uj - ud\\^ + 8K^ / E\\\X' -X'Wl^, J dr. 



T 

17 fll vt vt II 

[t,rAT 

Moreover, using the Lipschitz property (j2.2p of /i, we also have that 
\A\ - A\\ < \fi{r,uj (S>t X,iyr) - fj,{r,uj i^t X,Ur)\dr 
<Kj\\\u-u\\t + \\X'-X%t^,])dr 
for all t < s < T and then Jensen's inequality yields that 
E[\\A' - A'\\l^{\ <2K'T^\\u;-of,+2K^T r E[\\X' - X'\\l^^^{\ dr. 



8 



Hence, we have shown that 

E[\\X' - X'Wl^^] < Co\\u - + Co E[\\X' - X'Wl,.^,]] dr, 

where Cq depends only on K and T, and we conclude by Gronwall's lemma 
that 

E[\\X' - X'Wl^^] < C\\u; - uWl C := Coe^«'^. 

By the continuity of their sample paths, there exists a localizing sequence 
i'Tn)n>i of stopping times such that M*, M*, A^,A'' are bounded on [t, t„] for 
each n. Therefore, monotone convergence and the previous inequality yield 
that 

E[\\X'-X%^^^]<C\\u-0\\l (2.8) 
(ii) Let p be the minimal (nondecreasing) modulus of continuity for ip, 

p{z) := sup { — '>p{Co')\ : w, G fi*, \\Cj — cj'||[( < z}, 

and let p be the concave hull of p. Then p is a bounded continuous function 
satisfying p{<S) = and p> p- Let P := P{t,u},iy) and P := P(t,a),i^), then 
P and P are the distributions of X* and X^, respectively; therefore, 

\eP[^]-E^[^]\ = \Emx')-^P{X% < E[p{\\X'-X%,^T])]- (2.9) 

Moreover, Jensen's inequality and (|2.8p yield that 

E[p{\\X' - X'\\[t^T])] < p{E[\\X' - X'\\[t,T]]) 

< p{E[\\x^ - nuf') 



< p{VC\\co - Cd\\t) 

for every n. In view of ()2.9p . we have — < p{^/C\\uj — 0\\t); 

i.e., the result holds for PK,T,ip{z) ■= p[\fCz). □ 

After these preparations, we can prove the continuity of Vj. 

Proof of Proposition \2.5[ To disentangle the double dependence on uj in ()2.7p , 
we first consider the function 

(r/, u) ^ [^*''^] , {r],uj)enxn. 

Since ^ G UCb(f^), there exists a modulus of continuity p^^^ for ^; i.e., 

\aio)-a^')\<p^^\\\u;-u;'\\T), 
Therefore, we have for all oj £ that 

|e*'"(^) - = l^u ®i Cj) - ^LO' ®i ^)| 

< p^^^dlw (g)tu}-uj' »t 
= p^^\\\u:-u:'\\t). (2.10) 
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For fixed r/ € il, it follows that 

^^P{t,n,y)^^t,u>^ _ ^P{t,r,,^)j^t,a;']| < _ {2.11) 

Fix w £ Q and let V'cj := i^''^ ■ Then p^^^ yields a modulus of continuity 
for tjj^] in particular, this modulus of continuity is uniform in cj. Thus 
Lemma [2.61 implies that the mapping rj i— ?• E^^^''^''^\^J\ admits a modulus of 
continuity Pt,k,£, depending only on T,K,S^. In view of (|2.1ip . we conclude 
that 

|^P(t,,,.) [^t,.] _ ^P(tV,.)[^t,c.'] I < ^(0(11^ _ ^/||^) + _ ^'11,) 

for all r],r]' ,u},u}' E and in particular that 
|^P(t,.,.)[^t,c.]_^p(tX,<^)[^t,-']| p.^^p(0+p^^^^^ (2.12) 

for all ojjOj' G 0. Passing to the supremum over z/ G ZY*, we obtain that 
|V((a;) — V((a;')| < p{\\uj — uj'\\t) for all uj,uj' € 17, which was the claim. □ 

3 Pathwise Dynamic Programming 

In this section, we provide a pathwise dynamic programming principle which 
is fundamental for the subsequent sections. As we are working in the weak 
formulation (jl.4p . the arguments used here are similar to, e.g., |26) . while 
|22) gives a related construction in the strong formulation (i.e., working only 
under Pq). 

We assume in this section that the following conditional version of As- 
sumption 12.11 holds true; however, we shall see later (Lemma 14. 4p that this 
extended assumption holds automatically outside certain nuUsets. 

Assumption 3.1. Throughout Section [3l we assume that 

F^^°DF* for X := X(t,r?,z/), (3.1) 
for ah {t,r],i/) G [0, T] x J] x 

The main result of this section is the following dynamic programming 
principle. We shall also provide more general, quasi-sure versions of this 
result later (the final form being Theorem 15. 2p . 

Theorem 3.2. Let < s < t < T, ^ UC6($7) and set K(-) = Vr{^;-). 
Then 

T4(w) = sup £;^("'^''')[(yt)"'^] for all w G (3.2) 
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We remark that in view of Proposition 12.51 we may see C ^(Cj ") ^ ^ 
mapping UC;,(r2) — t- UC;,(r2) and recast (j3.2p as the semigroup property 

Vs = VsO Vt on VCbin) for all < s < t < T. (3.3) 

Some auxiliary results are needed for the proof of Theorem 13.21 which is 
stated at the end of this section. We start with the (well known) observation 
that conditioning the solution of an SDE yields the solution of a suitably 
conditioned SDE. 

Lemma 3.3. Let Q < s < t < T , u e W and u) e Q.. If X := X(s,£l),zy), 
then 

X^''^ = X{t,i:d0sX{u),u^''^) P^-a.s. 

for all w G 0'^. 

Proof. Let w € Using the definition and the flow property of X, we have 
Xr = ods+ f i^''''{u,X,Uu)du+ f a'^''{u,X,Vu)dBl 

J s J s 

= Xt+ ii{u,uj®sX,i'u)du+ I a{u,uj ®s X,i'u)dBl Pg-a.s. 
Jt Jt 

for all r G \t-,T]. Hence, using that {PqY'^ = Pg by the Pq -independence of 
the increments of P^, 

Xl'^ = j niu, oj^sX^'"^, i^h"^) du+ J aiu, a)®,X*'^, zy*-"^) dP* P/^-a.s. 

(3-4) 

Since X is adapted, we have X^'^{-) = X{u} (E>t •) = X{uj) on [s,t] and in 
particular 

a;(g)^X*''^ = w®, X(w) 0t X*'"^ = r/^t X*'"^, for r] := oj (g) s X (u) . 
Therefore, recalling that Xg = w^, ()3.4p can be stated as 

X*''^ = ry,+ [\'''i{u,X'''^,u'''^)du+ r a''"{u,X'''^,u'''')dBi P^^-a.s.; 
Jt Jt 

i.e., X^'^ solves the SDE (|2.6p for the parameters (t, ?], z^*''^). Now the result 
follows by the uniqueness of the solution to this SDE. □ 

Given t G [0, T] and u £ Q, we define 

V{t,u;) = {P{t,uj,i^) : 1^ eU^}. (3.5) 
These sets have the following invariance property. 
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Lemma 3.4. Let < s < t < T and ui € Q. If P e V{s,uj), then 

P^'"^ er{t,u)(S)sUj) for P-a.e.uefl'. 

Proof. Since P G V{s,u)), we have P = P{s,uj,v) for some u G i.e., 
setting X := X(s,a), u), P is the distribution of 

X" = fi{r, u) ®sX,Ur)dr + / a{r, Cd X, Vj.) dB^ under Pq • 

J s J s 

We set fir := ^(r, £D ®s X, f^) and (5"r- := a{r, uj <i^s X, ^r) and see the above as 
the integral /i^ dr -\- dB^ rather than an SDE. As in [261 Lemma 2.2], 
the nondegeneracy assumption (|3.ip imphes the existence of a progressively 
measurable transformation /3,y : — )• (depending on s, co, v) such that 

Pu{X') = B' P^-a.s. (3.6) 

Furthermore, a rather tedious calculation as in the proof of j26( Lemma 4.1] 
shows that 

P*.^ = Pn* o f f ft^;^"^"^^ dr+ I dBi] for P-a.e. cj G . 



Note that, abbreviating uj := we have 

and similarly for a^^l^^^^) . Hence, we deduce by Lemma [3.31 that 

pt,u> ^ p(^t,id(^s X{uj),u^'^) for P-a.e. uj G Q". 
In view of (|3.6p . we have 

X^(/3,(P^)) = P-a.s.; (3.7) 
i.e., X(ujy = X'^{u}) = CO for P-a.e. lo G Q^, and we conclude that 

pt,u^ = 0^ j,*.'^) for P-a.e. uGfl". (3.8) 

In particular, P*'"^ G V{t,UJ ®s uj). □ 

Lemma 3.5 (Pasting). Let ^<s<t<T,u)^VL,v^W and set 
P := P{s,U},u), X := X(s,LJ,i'). Let (£^*)o<i<Ar be a finite J^f -measurable 
partition of $1*, i/* G Z^* /or 1 < i < and define 9 by 



N 

i?{uj) := l[s,f)i'(w) + l[t,T] 



^(a;)lEo(A(a;)^) + ^i.^(a;*)lE.(X(o.)^ 

i=l 

T/ien P := P{s,uj, v) satisfies P = P on and 

pi,^ = P{t,u)®sUO,y') for P-a.e.ujeE\ l<i<N. 



12 



Proof. As = 1/ on [s,t), we have X{s,uj,iy') = X on [s,t] and in particular 
P = P on Fl- Let 1 < i < iV, we show that P*-"^ = P{t,u) ®s OJ,y'') for 
P-a.e. uj eE\ Recah from ([33]) that 

where /3p is defined as in (|3.6p . Since both sides of this equality depend only 
on the restriction of oj to [s, t] and X{s, a), P) = X on [s, t], we also have that 

P*."^ = p(t, u) (E)s OJ, i^*'^) for P-a.e. w G 0^ (3.9) 

where u = Pu{'^) is defined as below (|3.6p . Note that by ()3.7p . uj £ 
implies X^ljY £ under P. (More precisely, if A C 0'' is a set such that 
A C E^ P-a.s., then {X{CbY : to e A} C E^ P-a.s.) In fact, since X is 
adapted and E^ G Ff, we even have that 

X{uj(g)tCjy £ E' for all Cj £ n\ for P-a.e. oj £ E\ 

By the definition of z^, we conclude that 

j/*''^(tD) = z^(a;(g)jiD) = i/*((tJ(g)j;D)*) = z/*(lD), w G fi*, for P-a.e. lo £ E\ 

In view of (|3.9p . this yields the claim. □ 

Remark 3.6. In |26| and |17| . it was possible to use a pasting of measures 
as follows: in the notation of Lemma [3. 5 1 it was possible to specify measures 
on Q*, corresponding to certain admissible controls, and use the pasting 
P{A) := P{A n E^) + YlZi ^^[^'*(^*''^)li?»(w)] to obtain a measure in 
which again corresponded to some admissible control and satisfied 

phuj = pi for P-a.e. u £ E\ (3.10) 

which was then used in the proof of the dynamic programming principle. 

This is not possible in our SDE-driven setting. Indeed, suppose that 
P is of the form P(s,cj,z^) for some a) and u, then we see from ()3.8p that, 
when a is general, P*'"^ will depend explicitly on w, which contradicts (|3.10p . 
Therefore, the subsequent proof uses an argument where (|3.10p holds only 
at one specific w G on the rest of E^, we confine ourselves to controlling 
the error. 

We can now show the dynamic programming principle. Apart from the 
difference remarked above, the basic pattern of the proof is the same as in 
|26| Proposition 4.7]. 

Proof of Theorem \3.S[ Using the notation ()3.5p . our claim (j3.2p can be stated 
as 

sup E^[C'''^]= sup ^^[(14)"'^] for all O £ n. (3.11) 
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(i) We first show the inequahty "<" in (f3TT]) . Fixed and P G u). 
Lemma [3.41 shows that P^'^ G V{t,0 <^s ^) for P-a.e. uj £ il.^ and hence that 

< sup E^' [^*.'^®-^] 

= Vt{u! LV) 

= V'''^{uj) for P-a.e. u gQ'. 

Since Vt is measurable by Proposition 12.51 we can take P((iLi;)-expectations 
on both sides to obtain that 

eP[^''^] = E^\E^''"[{en''^]] <^^[y/'^]. 

We take the supremum over P G V{s,Ld) on both sides and obtain the claim. 

(ii) We now show the inequality ">" in p. lip . Fix ui £ Q, v £ and 
let P = P{s, a), z^). We fix e > and construct a countable cover of the state 
space as follows. 

Let LO G ri*. By the definition of Vt{ui (8>s oj), there exists v^^^ G such 
that := P(t,a)(8)^a;,z^('^)) satisfies 

Vt{0 0, u) < ^^'"^ [^t,^<^s^] + (3.12) 

Let B(e,uj) C fi* denote the open || • j]-ball of radius e around co. Since 
II • ||[s,t]) is a separable (quasi-)metric space and therefore Lindelof, there 
exists a sequence (a;*)j>i in il.^ such that the balls := i?(e,a)'*) form a 
cover of 0*. As an || • ||[^ (]-open set, each is J^/-measurable and hence 

E^:=B^, E'+'^ := B'+^\{E^U---UE'), i>l 

defines a partition (£^*)j>i of fi*. Replacing E'' by 

(i^*UK})\{c:;^ : j>l,j/i} 

if necessary, we may assume that G E^ for i > 1. We set := v^^''^ and 
P' := P{t, oj(^s Cj\u'). 

Next, we paste the controls u\ Fix iV G N and let Am := E'^ {J ■ ■ ■ {J E^ , 
then {j4^, . . . , E^} is a finite partition of Vt^ . Let X := X(s, a), i/), define 



AT 

i=l 



and let P := P(s,(D,P). Then, by Lemma l3.5[ we have P = P on J"/ and 
pt.o; _ p(^^ uj^gUj^ z^*) for all u € E^, for some subset i?* C E^ of full measure 
P. Let us assume for the moment that 

uj' e & for 1 < i < N, (3.13) 
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then we may conclude that 

pt,^' ^ pi foj. 1 < i < AT. (3.14) 

Recah from Proposition 12.51 that Vt admits a modulus of continuity p^^*\ 
Moreover, we obtain similarly as in (|2.10p that there exists a modulus of 
continuity p^^^ such that 

Let u £ Q'^ for some 1 < i < N , then \\u} — w^Hj^^^j < e. Together 

with ([3J2]) and <^J^, we obtain that 

= E^''"^ [^t,u^^s^^] + e + p(^*)(e). (3.15) 
Recall from (j2.12p that the mapping 

is uniformly continuous with a modulus p independent of i and N. Since 
oj £ E^, it follows that 

< p{e) for P-a.e. w G (3.16) 

Setting p{e) := p(e) + e + p^^*\e) and noting that 

^p*.- j^i,^®,^] = E^''"[{e''^Y''^] = E^[^'''^\Tt']{Lo), 

the inequalities (|3.15p and ()3.16p imply that 

y/'^H<i?^[r'1-F/]H + p(e) (3.17) 

for P-a.e. (and thus P-a.e.) w G E\ This holds for ah 1 < i < iV. As P = P 
on J-"/, taking P-expectations yields 

E^'iVrU^] < EP-[C''^lA^]+p{e), (3.18) 

where we write P/v = -P to recall the dependence on A^. Since f 0*, we 
have Pn{A%) = P{A%) as N ^ oo. In view of 

e^^i^'^'iaJ = E^^i^'^] - E^^ie'^'Ui,] < E^^[en + uWooPNiA'k), 
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we conclude from ()3.18p that 



E^iV''^] < limsup^^^[e''^] + pie) < sup + p{e). 

Since P G V{s,ui) was arbitrary, letting e — )• completes the proof of (|3.1ip . 

It remains to argue that our assumption p.l3p does not entail a loss of 
generality. Indeed, assume that ^ for some i. Then there are two 
possible cases. The case P{E^) = is easily seen to be harmless; recall 
that the measure P was fixed throughout the proof. In the case P{E^) > 0, 
we also have P{E^) > and in particular E^ ^ 0. Thus we can replace 
tu* by an arbitrary element of E^ (which can be chosen independently of 
N). Using the continuity of the value function (Proposition 12. 5( ) and of the 
reward function (|2.12p , we see that the above arguments still apply if we add 
an additional modulus of continuity in (|3.15p . □ 



4 Extension of the Value Function 

In this section, we extend the value function ^ i— )• Vt{S,; •) to an L^-type space 
of random variables ^, in the spirit of, e.g., [8]. While the construction of 
Vt in the previous section required a precise analysis "u; by oj", we can now 
move towards a more probabilistic presentation. In particular, we shall often 
write Vt{(,) for the random variable uj i— )• Vt{(,;u)). 

For reasons explained in Remark l4.2l b elow . we fix from now on an initial 
condition x € M"^ and let 

V^:= {P{0,x,u) -.ueU} 

be the corresponding set of measures at time s = 0. Given a random variable 
tp on ri, we write tp^ as a shorthand for tp^'^ = ip^x^^Q •). We also write Vf{^) 
for {VtiOr. 

Given p £ [1 , oo) , we define Lj, to be the space of J"T-measurable random 
variables X satisfying 



■= sup ||X||ip(p) < oo, 



where ||X||^p^pj := More precisely, we identify functions which are 

equal "Paj-quasi-surely, so that Lp becomes a Banach space. (Two functions 
are equal Pa;-quasi-surely, Vx-(l-s. for short, if they are equal P-a.s. for all 
P S Vx-) Furthermore, given t G [0,T], 

hp^{J^t) is defined as the || • -closure of UCh(f^j) C Lp^. 

Since any -convergent sequence has a P^^-q.s. convergent subsequence, 
any element of (J^t) has an J^^-measurable representative. For brevity, 
we shall often write for (J't)- 
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Remark 4.1. The space can be described as follows. We say that 
^ S Lj,^ is P2;-quasi uniformly continuous if has a representative ^' with 
the property that for all e > there exists an open set G C Q such that 
P{G) < £ for all P G "P and such that the restriction is uniformly con- 

tinuous. Then consists of all ^ G Lj, such that is "Paj-quasi uniformly 
continuous and lim„_i.oo IICl{|5|>n.}llL^ = 0- Moreover, If Vx is weakly rela- 
tively compact, then Lp contains all bounded continuous functions on fi. 

The proof is the same as in |17| Proposition 5.2], which, in turn, followed 
an argument of [7]. 

Before extending the value function to L^^, let us explain why we are 
working under a fixed initial condition a; G M'^. 

Remark 4.2. There is no fundamental obstruction to writing the theory 
without fixing the initial condition x; in fact, most of the results would 
be more elegant if stated using V instead of Vx, where V is the set of all 
distributions of the form ()2.4p . with arbitrary initial condition. However, the 
set P is very large and therefore the corresponding space is very small, 
which is undesirable for the domain of our extended value function. As an 
illustration, consider a random variable of the form := /{ojq) on Q, 

where / : R'^ — >■ R is a measurable function. Then 

ll^ll^i = sup \f{x)\; 

i.e., ^ is in Lp only when / is uniformly bounded. As a second issue in the 
same vein, it follows from the Arzela-Ascoli theorem that the set V is never 
weakly relatively compact. The latter property, which is satisfied by Vx for 
example when and a are bounded, is sometimes useful in the context of 
quasi-sure analysis. 

Lemma 4.3. Let p G [l,oo). The mapping Vf on \]Ci,{0,) is 1-Lipschitz, 
As a consequence^ Vf uniquely extends to a Lip schitz- continuous mapping 

Proof. The argument is standard and included only for completeness. Note 
that — V'l^ is again in UCft(r2). The definition of Vf and Jensen's inequality 
imply that \Vf{0 - Vf{'ilj)\P < Vf{\^ - i;\)P < Vf {\C - Therefore, 

WVfiO-VfmiL'L < snp EP[Vf{\C-mY^'= snp E^ilC-m'/P, 



where the equality is due to (|3.2|) applied with s = 0. (For the case s = 0, the 
additional Assumption 13.11 was not used in the previous section.) Recalling 
from Proposition 12.51 that Vf maps UCf,(0) to \JCb{i}t), it follows that the 
extension maps L^^ to Lp^(J^t). □ 
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4.1 Quasi-Sure Properties of the Extension 

In this section, we provide some auxiliary results of technical nature. The 
first one will (quasi-surely) allow us to appeal to the results in the previous 
section without imposing Assumption 13.11 This is desirable since we would 
like to end up with quasi-sure theorems whose statements do not involve 
regular conditional probability distributions. 

Lemma 4.4. Assumption \2.1\ implies that Assumption \3.1\ holds for Vx- 
quasi-every rj £ Q satisfying rjo = x. 

For the proof of this lemma, we shall use the following result. 

Lemma 4.5. Let Y and Z be continuous adapted processes, t G [0,T] and 
let P be a probability measure on Q. Then 5 ¥^ implies that 

F^^*'" D F^*'" for P-a.e. uj e n. 

Proof. The assumption implies that there exists a progressively measurable 
transformation /3 : J7 — ?■ i7 such that Z = /3{Y) P-a.s. For P-a.e. w € O, 
it follows that Z{uj (^t •) = •)) P^'^-a.s., which, in turn, yields the 

result. □ 



Proof of Lemma\4.4\ Let X := X{t, rj, u) with r/o = x, we have to show that 



pt 

F^ ° D F* whenever rp is outside some "P^-polar set. Hence we shall fix an 
arbitrary P £ Vx and show that the result holds on a set of full measure P. 

Let P G Vx, then P = P(0, x, u) for some v GlA and P is concentrated on 
the image of X^, where X := X(0, x, u). That is, recalling that r/o = = x, 
we may assume that rj = X[uj) for some uj ^Vt. Let 

^{u)) := l[o,t)i>(a;) + (4.1) 

Then X := X{0,x,P) satisfies X = X on [0,t] and hence we may assume 
that 7] = X{uj) on [0,t]. Using Lemma [3. 3 1 and (|4.ip . we deduce that 

X*."^ = X{t,x^ z?*'"') = X{t, r], u) = X. 

Since F C F"^ by Assumption 12.11 we conclude that 

pt — = — pt pt 

C F^''" = F^ 

by using Lemma 14.51 with Z being the canonical process. □ 

The next two results show that (for fi = and a positive definite) the 
mapping ^ i— >• Vf{^) on L^^ falls into the general class of sublinear expecta- 
tions considered in |19) . whose techniques we shall apply in the subsequent 



18 



section. More precisely, the two lemmas below yield the validity of its main 
condition |19l Assumption 4.1]. 

The following property is known as stability under pasting and well known 
to be important in non-Markovian control. It should not be confused with the 
pasting discussed in Remark 13. 6| where the considered measures correspond 
to different points in time. 

Lemma 4.6. Let r be an ¥ -stopping time and let A G J>. Let P, P^,P^ G Vx 
satisfy P^ = P^ = P on Then 

P{A) := EP[P\A\Fr)lK + P\A\Fr)U'^], A^Tt 

defines an element ofVx- 

Proof. It follows from the definition of the conditional expectation that P is 
a probability measure which is characterized by the properties 

P = P on J^r and = <^ ^ \ ' 4.2 

[(p2)rH,^ for P-a.e. wgA^ 

Let u,u^,u'^ £U be such that P = P{0, x, v) and = P(0, x, i/) for i = 1, 2. 
Moreover, let X := A(0, x, v), define z/ G by 

Ur{uj) := l[o^r{X(uj)0)){r)l^r{u}) 

+ lWXHO),T](0['^r(w)lA(^M°) + '^r(w)lA=(^H°; 

and let := P(0,x,z7) G Vx- We show that P* satisfies the three prop- 
erties from (|4.2p . Indeed, u = u on [0,r(A'^)) implies that P^, = P on J>. 
Moreover, as in (j3.8p . 



^r(c.),a; ^ p[r{oj)^x ®Q Lo, zy^C^H")^'^) for P-a.c. uen. 
Similarly as below (j3.9p . we also have that 

-r(XHO),^ = {Co ®,(x(a.)0) •) = (z.i)-(^(-)°)'- for P-a.e. ^ G A. 
Therefore, 

p;(")''^ = P(r(L^), a; »o co, (i.i)-(^H'').-) = (pi)-H.- for P-a.e. ^ G A. 

An analogous argument establishes the third property from (|4.2p and we 
conclude that P = P^, G Vx- □ 

The second property is the quasi-sure representation of Vf (^) on L^^ , a 
result which will be generalized in Theorem 15.21 below. 
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Lemma 4.7. Let t G [0,r] and ^ G L^^. Then 

y^^(^) = esssup^ P-a.s. /or a/? P G P^, (4.3) 

t/;/iere P) := {P' e : P' = P on Tt}- 

Proof. Recall that Lemma 14.41 allows us to appeal to the results of Section [3j 

(i) We first prove the inequality "<" for ^ G UCfe(O). Fix P e V^. We 
use Step (ii) of the proof of Theorem 13.21 in particular (|3.17p . for the special 
case s = and obtain that for given e > and > 1 there exists a measure 
Pn e Vx{J^t, P) such that 

Vt'^iu) < E^^ [i''\J't]{oj) + p{e) for P-a.s. a; G U • • • U S^, 

where V^{ui) := V^{£^;ijj). Since IJj>i = Q P-a.s., we deduce that 

Vf{uj) < sup E^^[C''\Tt]{io) + p{e) for P-a.s. w G Jl. 

Af>l 

The claim follows by letting e — t- 0. 

(ii) Next, we show the inequality ">" in ([33]) for ^ G VCbi^). Fix 
P, P' G Vx and recaU that (P')*''^ G P(t, rr (g)o w) for P'-a.s. cj G JTi by 
Lemma [3.41 Therefore, ()3.2p applied with s := t and t := T yields that 

F^^x^) = ^t(x 00 w) > = ^(■p')*'''[(e^')*'^] = E^'[e\j^s]{uj) 

p'-a.s. on /"t. If P' G Vx{J~t, P), then P' = P on J-^ and the inequality holds 
also P-a.s. The claim follows as P' G P^(J-t,P) was arbitrary. 

(iii) So far, we have proved the result for G \JCi,{Q). The general 
case ^ G L^^ can be derived by an approximation argument exploiting the 
stability under pasting (Lemma 14. 6p . We omit the details since the proof is 
exactly the same as in |17| Theorem 5.4]. □ 

5 Path Regularity for the Value Process 

In this section, we construct a cadlag P2;-iiiodification for l^^(^); that is, a 
cadlag process such that Yf = Vf{(,) Vx-I-^- for all t G [0, T]. (Recall 
that the initial condition x G M*^ has been fixed.) To this end, we extend the 
raw filtration F as in |19| : we let F"*" = {J^t+}o<t<T be the minimal right- 
continuous filtration containing F and we augment F"*" by the collection Af^^ 
of (P^:) -pT)-polar sets to obtain the filtration 

G = {gt}o<t<T, Qt-=J't+yM'^-. 

We note that G depends on x G M'^ since N"^"^ does, but for brevity, we shall 
not indicate this in the notation. In fact, the dependence on x is not crucial: 
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we could also work with F+, at the expense of obtaining a modification which 
is Vx-(l-s. equal to a cadlag process rather than being cadlag itself. 

We recall that in the quasi-sure setting, value processes similar to the one 
under consideration do not admit cadlag modifications in general; indeed, 
while the right limit exists quasi-surely, it need not be a modification (cf. 
[19)). Both the regularity oi £ Lp^ and the regularity induced by the SDE 
are crucial for the following result. 

Theorem 5.1. Lei ^ G L^^ . Th ere exists a (Vx-q-s. unique) G-adapted 
cadlag Vx -modification S^'i^) = {<?f (0}iG[o,T] o/ {^^^(OliglCT] • Moreover, 

£f{i)= esssup^ ^^'[ri^t] P-a-S- for all PeVx, (5.1) 
for all t G [0,r]. 

Proof. In view of Lemmata 14.61 and 14. 7| we obtain exactly as in |19| Propo- 
sition 4.5] that there exists a ^^.-q.s. unique G-adapted cadlag process 
satisfying (j5.ip and 

fr(e)=-^^t+(e):=limK"(0 Vx-q.s. for ah < t < T. (5.2) 

rlt 

The observation made there is that (j4.3p implies that l^^'(^) is a (P, F)- 
supermartingale for all P E Vx, so that one can use the standard modi- 
fication argument for supermartingales under each P. This argument, cf. 
[g Theorem VI.2], also yields that E^[£^{0\J='t+] < Vt'^i^) P-a.s. and in 
particular 

E''[£f{0]<E''[Vt^O] for all PgVx- 
Hence, it remains to show that 

£f{0>VfiO rx-q.s. (5.3) 

for t G [0, T), which is the part that is known to fail in a more general setting. 
We give the proof in several steps. 

(i) We first show that £^ maps UC;,(r2) to Lp^(J-t), and in fact even to 
UCb(r2t) if a suitable representative is chosen. Let S UC;,(r2), r E {t,T] and 
set := V^{^). By Proposition 12. 5t there exists a modulus of continuity p 
independent of r such that 

\V,^i0j)-V,^{u')\<pi\\u-u'\\r). 

Hence the Vx-(l-s. limit from (|5.2p satisfies 

\Vt+{uj)-Vt+{oj')\<p{\\oj-uj'\\r) for ah r€{t,T]nQ, Vx-qs. 
Since £t{C) = Vt-)_(^), taking the limit r \^t yields that 

\£fm^) - £fm^')\ < p(ik - Vx-q.s. 
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By a variant of Tietze's extension theorem, of. |16) . this imphes that £t{^) co- 
incides Vx-^.-S- with an element of UC6(0t). In particular, £^{(,) G Lp^(J^(). 

(ii) Next, we show that is Lipschitz-continuous. Let ^, G sind 
tn i t. Using (|5.2p . Fatou's lemma and Lemma 14.31 we obtain that 

ll^f(0 -^fWllLi = \\linin\VZiO - VfjnLi 

' ^ 'Px 

= sup EP[limn\VZ{0-%:m] 
< sup ]immfEP[\V,liO-VZm] 

<iie-V'iiL^ • 

(iii) Let ^ £ L^^. Then there exist G UCfe(17) such that ^" ^ ^ in L^^ 
and in thus Sf{C)^ 8f{i) in L],^ by Step (ii). Since -Sf (D G Li,JJi) by 
Step (i) and since lJp^{Ft) is closed in Llp^, we conclude that 

£KO<^^vS^t) for ah eelLk- 

(iv) Let ^ G ^p^- Since V^^ is the identity on lJp^{Ft), Step (iii) implies 
that Sf{^) = V^{£f{^)). Moreover, the representations ()4.3p and ()5.ip yield 

Vn£t{i))= esssup^ i?^'[^:r(OI-^t] 

P'e7':.(JF-t,P) 

> esssup^ EP'[EP'[e\Qt]\J't] 
p'&v,{rt,P) 

= esssup^ E^'[e\Ft] 
P'eV^{J^t,P) 

= VfiO P-a.s. for ah P £Vx- 
We conclude that (15.31) holds true. □ 



Since £^{i) is a cadlag process, its value at a stopping time r is well 

defined. The following result states the quasi-sure representation of £^{C) 
and the quasi-sure version of the dynamic programming principle in its final 
form. 

Theorem 5.2. Let < g < t < T be G-stopping times and £^ G L^^. Then 
£^(0= esssup^ E^'[£^{0\Gg] P-a.s. for all P e (5.4) 

P'eVa:{ge,P) 

and in particular 

£^iC)= esssup^ E^'ieiGg] P-a.s. for all PeV^- 

P'eVa:{ge,P) 

Moreover, there exists for each P ^ Vx a sequence P^ G Vx{Gq, P) such that 



£^^{0 = }iniX-[e\Q,] P-a.s. 
with a P-a.s. increasing limit. 
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Proof. In view of Lemmata 14.61 and 14. 7| the result is derived exactly as in 
[T9l Theorem 4.9]. □ 



As in (j3.3p . the relation (j5.4p can be seen as a semigroup property 

at least when is in the domain L^^ of The latter is guaranteed by 

Lemma [4. 31 when r is a deterministic time. However, one cannot expect <?^(^) 
to be quasi uniformly continuous (cf. Remark l4.ip for a general stopping time, 
for which reason we prefer to express the right hand side as in ()5.4p . 

6 Hamilton-Jacobi-Bellman 2BSDE 

In this section, we characterize the value process as the solution of 

a 2BSDE. To this end, we first examine the properties of B under a fixed 
P GVx- The following result is in the spirit of |28| Section 8]. 

Proposition 6.1. Let x e R'^ , u € U and P := P(0, x,!/). There exists a 
progressively measurable transformation /3 : — t- 17 {depending on Xjv) such 
that W := /3{B) is a P-Brownian motion and 

f^=W^. (6.1) 
Moreover, B is the P-a.s. unique strong solution of the SDE 

B= ^J,{t,x + B,ut{W))dt+ / a{t,x + B,ut{W))dWt under P. 
Jo Jo 

Proof. Let X := X{0,x,i'). As in Lemma 13.41 Assumption 12.11 implies the 
the existence of a progressively measurable transformation /3 : — )• such 
that 

= B Po-a.s. (6.2) 

Let W := f3{B). Then 

(i?,xO)p„ = (/3(XO),xO)p„ = {P{B),B)p = {W,B)p; 

i.e., the distribution of {B,X^) under Pq coincides with the distribution of 
(W, B) under P. In particular, is a P-Brownian motion. Moreover, we 

have F^" = F^(^") by Assumpt ion 12.11 and therefore 



which is (fUT^ . Note that 

X^ = X\B) = j ix{t,x + X^,ut{B))dt + j a{t,x + X^,ut{B))dBt 
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under Pq. Let Y be the (unique, strong) solution of the analogous SDE 
Y = j n{t,x + Y,ut{W))dt + j a{t,x + Y,ut{W))dWt under P. 



Using the definition of P and (|6.2p . we have that 

{Y,W)p = {X\W),W)p 
= {X\B),B)p, 

= {B,l5{B))p 
= {B,W)p. 

In view of (16.11). it follows that Y = B holds P-a.s. □ 



In the sequel, we denote by M^'^ the local martingale part in the canon- 
ical semimartingale decomposition of B under P. 

— p 

Corollary 6.2. Let P £ Vx- Then the filtration ¥ is right- continuous. If, 
in addition, a is invertible, then {M^'^ , P) has the predictable representation 
property. 

P 

The latter statement means that any right-continuous (F , P)-local mar- 
tingale A'^ has a representation N = Nq -\- f Z dM^'^ under P, for some 
— p 

F -predictable process Z. 

P 

Proof. We have seen in Proposition 16.11 that F is generated by a Brow- 

nian motion W, hence right-continuous, and that M^'^ = at dWt for 

(Tj := a{t, X -\- B, utiW)), where u £ti. By changing o" on a x P-nuUset, 

— P 

we may assume that is F -predictable. Using the Brownian representation 
theorem and W = / dM^'^ , we deduce that M^'^ has the representa- 
tion property. □ 

The following formulation of 2BSDE is, of course, inspired by |27) . 

Definition 6.3. Let ^ G L^^ and consider a pair (Y, Z) of processes with 
values in M X M*^ such that Y is cadlag G-adapted while Z is G-predictable 
and Jq iZsl"^ d{B)s < oo Vx-c[-s. Then {Y,Z) is called a solution of the 

2BSDE (|6.3|) if there exists a family {K^)p^p^ of F^-adapted increasing 
processes satisfying £^^[|i(r|^'|] < oo such that 

rT 

Yt=i- Zs dM^'^ + - Kf, 0<t<T, P-a.s. for all P G 

(6.3) 

and such that the following minimality condition holds for all < t < T: 

essinf^ E^' \KJ^' - Kf"\gt] = P-a.s. for ah P G P^. (6.4) 

P'£V45t,P) L ^ J I J 
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Moreover, a cadlag process Y is said to be of class {D,Vx) if the family 
is uniformly integrable under P for all P G Vx, where r runs through 
all G-stopping times. The following is our main result. 

Theorem 6.4. Assume that a is invertible and let ^ G ^v^- 

(i) There exists a {dt x Vx-q-s. unique) G-predictable process such that 

= {d{B,B)^y^ d{£'={C),B)^ P-a.s. for all PeVx- (6.5) 



(a) The pair {£^{^), Z^) is the minimal solution of the 2BSDE (|6.3|) .' i.e., 
if (y, Z) is another solution, then £^{C) < Y Vx-q.s. 

(Hi) If {Y,Z) is a solution of ()6.3p such that Y is of class {D,Vx), then 
{Y,Z) = {£-{0,Z^). 

In particular, if € L^^ for some p > 1, then {£^{^), Z^) is the unique 
solution of ()6.3p in the class {D,Vx)- 

Proof. Given two processes which are (cadlag) semimartingales under all 
P & Vx, their quadratic covariation can be defined Pjj-q.s. by using the 
integration-by-parts formula and Bichteler's pathwise stochastic integration 
[H Theorem 7.14]; therefore, the right hand side of (|6.5p can be used as a 
definition of Z^. The details of the argument are as in |19| Proposition 4.10]. 

Let P & Vx- By Proposition 16. H B is an Ito process under P; in par- 
ticular, we have {B,S)^ = {M^'^ , S)^ P-a.s. for any P-semimartingale S. 
The Doob-Meyer theorem under P and Corollary 16.21 then yield the decom- 
position 

£^{^) = £^{^) + j Z^ dM^'P - P-a.s. 

and we obtain (ii) and (iii) by following the arguments in |19l Theorem 4.15]. 
If ^ G Lp^ for some p S (1, cxd), then £^{C) is of class (D,^^;) as a consequence 
of Jensen's inequality (cf. |19| Lemma 4.14]). Therefore, the last assertion 
follows from the above. □ 

We conclude by interpreting the canonical process P, seen under the 
"set of scenarios" Vx^ as a model for drift and volatility uncertainty in the 
Knightian sense. 

Remark 6.5. Consider the set-valued process 

Dt(a;) := { (^(t, w, u), o-(t, a;, u)) : u G ^7} C M'' x M'^^'^. 

In view of Proposition 16.11 each P £ Vx can be seen as a scenario in which 
the drift and the volatility (of P) take values in D, P-a.s. Then, the upper 
expectation £^{S,) is the corresponding worst-case expectation (see |19| for 
a connection to superhedging in finance). Note that D is a random process 
although the coefficients of our controlled SDE are non-random. Indeed, 
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the path-dependence of the SDE translates to an w-dependence in the weak 
formulation that we are considering. 

In particular, for = 0, we have constructed a sublinear expectation 
similar to the random G-expectation of |17| . While the latter is defined by 
specifying a set-valued process like D in the first place, we have started here 
from a controlled SDE under Pq. It seems that the present construction 
is somewhat less technical that the one in |17) ; in particular, we did not 
work with the process a = d{B)t/dt which played an important role in |26| 
and |17| . However, it seems that the Lipschitz conditions on ^ and a are 
essential, while |17) merely used a notion of uniform continuity. 
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